Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 1031175 |
| Missing cells | 72276 |
| Missing cells (%) | 0.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.2 GiB |
| Average record size in memory | 1.2 KiB |
Variable types
| Numeric | 5 |
|---|---|
| Text | 10 |
| URL | 3 |
| Categorical | 1 |
Language is highly imbalanced (78.5%) | Imbalance |
city has 14103 (1.4%) missing values | Missing |
state has 22798 (2.2%) missing values | Missing |
country has 35374 (3.4%) missing values | Missing |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
rating has 647323 (62.8%) zeros | Zeros |
Reproduction
| Analysis started | 2025-02-26 11:12:58.857438 |
|---|---|
| Analysis finished | 2025-02-26 11:14:28.434495 |
| Duration | 1 minute and 29.58 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
Unnamed: 0
Real number (ℝ)
Uniform  Unique 
| Distinct | 1031175 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 515587 |
| Minimum | 0 |
|---|---|
| Maximum | 1031174 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 51558.7 |
| Q1 | 257793.5 |
| median | 515587 |
| Q3 | 773380.5 |
| 95-th percentile | 979615.3 |
| Maximum | 1031174 |
| Range | 1031174 |
| Interquartile range (IQR) | 515587 |
Descriptive statistics
| Standard deviation | 297674.73 |
|---|---|
| Coefficient of variation (CV) | 0.57735111 |
| Kurtosis | -1.2 |
| Mean | 515587 |
| Median Absolute Deviation (MAD) | 257794 |
| Skewness | 1.4842383 × 10-15 |
| Sum | 5.3166042 × 1011 |
| Variance | 8.8610243 × 1010 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 687455 | 1 | < 0.1% |
| 687442 | 1 | < 0.1% |
| 687443 | 1 | < 0.1% |
| 687444 | 1 | < 0.1% |
| 687445 | 1 | < 0.1% |
| 687446 | 1 | < 0.1% |
| 687447 | 1 | < 0.1% |
| 687448 | 1 | < 0.1% |
| 687449 | 1 | < 0.1% |
| Other values (1031165) | 1031165 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 1031174 | 1 | |
| 1031173 | 1 | |
| 1031172 | 1 | |
| 1031171 | 1 | |
| 1031170 | 1 | |
| 1031169 | 1 | |
| 1031168 | 1 | |
| 1031167 | 1 | |
| 1031166 | 1 | |
| 1031165 | 1 |
user_id
Real number (ℝ)
| Distinct | 92107 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 140594.37 |
| Minimum | 2 |
|---|---|
| Maximum | 278854 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 14422 |
| Q1 | 70415 |
| median | 141210 |
| Q3 | 211426 |
| 95-th percentile | 264331.7 |
| Maximum | 278854 |
| Range | 278852 |
| Interquartile range (IQR) | 141011 |
Descriptive statistics
| Standard deviation | 80524.435 |
|---|---|
| Coefficient of variation (CV) | 0.57274294 |
| Kurtosis | -1.2265592 |
| Mean | 140594.37 |
| Median Absolute Deviation (MAD) | 70616 |
| Skewness | -0.023981499 |
| Sum | 1.449774 × 1011 |
| Variance | 6.4841846 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11676 | 11144 | 1.1% |
| 198711 | 6456 | 0.6% |
| 153662 | 5814 | 0.6% |
| 98391 | 5779 | 0.6% |
| 35859 | 5646 | 0.5% |
| 212898 | 4290 | 0.4% |
| 278418 | 3996 | 0.4% |
| 76352 | 3329 | 0.3% |
| 110973 | 2971 | 0.3% |
| 235105 | 2943 | 0.3% |
| Other values (92097) | 978807 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 8 | 17 | |
| 9 | 3 | < 0.1% |
| 10 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 14 | 4 | < 0.1% |
| 16 | 2 | < 0.1% |
| 17 | 7 | |
| 19 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 278854 | 8 | < 0.1% |
| 278852 | 1 | < 0.1% |
| 278851 | 23 | < 0.1% |
| 278849 | 4 | < 0.1% |
| 278846 | 1 | < 0.1% |
| 278844 | 2 | < 0.1% |
| 278843 | 60 | |
| 278838 | 6 | < 0.1% |
| 278836 | 1 | < 0.1% |
| 278832 | 3 | < 0.1% |
location
Text
| Distinct | 22480 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 81.6 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 59 |
| Mean length | 25.180217 |
| Min length | 3 |
Unique
| Unique | 8399 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | stockton, california, usa |
|---|---|
| 2nd row | timmins, ontario, canada |
| 3rd row | ottawa, ontario, canada |
| 4th row | n/a, n/a, n/a |
| 5th row | sudbury, ontario, canada |
| Value | Count | Frequency (%) |
| usa | 746584 | 21.1% |
| california | 107540 | 3.0% |
| canada | 99471 | 2.8% |
| new | 87325 | 2.5% |
| n/a | 44526 | 1.3% |
| texas | 44185 | 1.2% |
| ontario | 41505 | 1.2% |
| york | 37561 | 1.1% |
| virginia | 36906 | 1.0% |
| florida | 34285 | 1.0% |
| Other values (14214) | 2260285 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3266628 | |
| 2509400 | 9.7% | |
| , | 2065908 | 8.0% |
| n | 1904641 | 7.3% |
| s | 1851542 | 7.1% |
| i | 1701268 | 6.6% |
| o | 1549232 | 6.0% |
| e | 1448319 | 5.6% |
| r | 1307523 | 5.0% |
| u | 1210337 | 4.7% |
| Other values (81) | 7150412 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25965210 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 3266628 | |
| 2509400 | 9.7% | |
| , | 2065908 | 8.0% |
| n | 1904641 | 7.3% |
| s | 1851542 | 7.1% |
| i | 1701268 | 6.6% |
| o | 1549232 | 6.0% |
| e | 1448319 | 5.6% |
| r | 1307523 | 5.0% |
| u | 1210337 | 4.7% |
| Other values (81) | 7150412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25965210 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 3266628 | |
| 2509400 | 9.7% | |
| , | 2065908 | 8.0% |
| n | 1904641 | 7.3% |
| s | 1851542 | 7.1% |
| i | 1701268 | 6.6% |
| o | 1549232 | 6.0% |
| e | 1448319 | 5.6% |
| r | 1307523 | 5.0% |
| u | 1210337 | 4.7% |
| Other values (81) | 7150412 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25965210 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 3266628 | |
| 2509400 | 9.7% | |
| , | 2065908 | 8.0% |
| n | 1904641 | 7.3% |
| s | 1851542 | 7.1% |
| i | 1701268 | 6.6% |
| o | 1549232 | 6.0% |
| e | 1448319 | 5.6% |
| r | 1307523 | 5.0% |
| u | 1210337 | 4.7% |
| Other values (81) | 7150412 |
age
Real number (ℝ)
| Distinct | 93 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.429017 |
| Minimum | 5 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 MiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 31 |
| median | 34.7439 |
| Q3 | 41 |
| 95-th percentile | 57 |
| Maximum | 99 |
| Range | 94 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 10.353539 |
|---|---|
| Coefficient of variation (CV) | 0.28421133 |
| Kurtosis | 1.2230354 |
| Mean | 36.429017 |
| Median Absolute Deviation (MAD) | 4.7438999 |
| Skewness | 0.76586581 |
| Sum | 37564691 |
| Variance | 107.19578 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34.74389988 | 282595 | |
| 33 | 32864 | 3.2% |
| 29 | 30648 | 3.0% |
| 30 | 27202 | 2.6% |
| 32 | 26492 | 2.6% |
| 36 | 26097 | 2.5% |
| 28 | 25967 | 2.5% |
| 31 | 25965 | 2.5% |
| 34 | 25893 | 2.5% |
| 38 | 22396 | 2.2% |
| Other values (83) | 505056 |
| Value | Count | Frequency (%) |
| 5 | 159 | < 0.1% |
| 6 | 14 | < 0.1% |
| 7 | 148 | < 0.1% |
| 8 | 542 | 0.1% |
| 9 | 2056 | |
| 10 | 227 | < 0.1% |
| 11 | 513 | < 0.1% |
| 12 | 747 | 0.1% |
| 13 | 1243 | 0.1% |
| 14 | 3206 |
| Value | Count | Frequency (%) |
| 99 | 5 | < 0.1% |
| 98 | 1 | < 0.1% |
| 97 | 127 | |
| 96 | 13 | < 0.1% |
| 95 | 1 | < 0.1% |
| 94 | 1 | < 0.1% |
| 93 | 34 | < 0.1% |
| 92 | 47 | < 0.1% |
| 90 | 45 | < 0.1% |
| 89 | 2 | < 0.1% |
isbn
Text
| Distinct | 270170 |
|---|---|
| Distinct (%) | 26.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.9 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 10.000012 |
| Min length | 10 |
Unique
| Unique | 145658 ? |
|---|---|
| Unique (%) | 14.1% |
Sample
| 1st row | 0195153448 |
|---|---|
| 2nd row | 0002005018 |
| 3rd row | 0002005018 |
| 4th row | 0002005018 |
| 5th row | 0002005018 |
| Value | Count | Frequency (%) |
| 0971880107 | 2502 | 0.2% |
| 0316666343 | 1295 | 0.1% |
| 0385504209 | 883 | 0.1% |
| 0060928336 | 732 | 0.1% |
| 0312195516 | 723 | 0.1% |
| 044023722x | 649 | 0.1% |
| 067976402x | 618 | 0.1% |
| 0142001740 | 615 | 0.1% |
| 0671027360 | 586 | 0.1% |
| 0446672211 | 585 | 0.1% |
| Other values (269853) | 1021990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1932130 | |
| 4 | 1082823 | |
| 1 | 1062075 | |
| 5 | 1038144 | |
| 3 | 1034312 | |
| 2 | 876562 | |
| 7 | 861219 | |
| 6 | 841552 | |
| 8 | 814482 | |
| 9 | 682550 | 6.6% |
| Other values (29) | 85913 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10311762 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1932130 | |
| 4 | 1082823 | |
| 1 | 1062075 | |
| 5 | 1038144 | |
| 3 | 1034312 | |
| 2 | 876562 | |
| 7 | 861219 | |
| 6 | 841552 | |
| 8 | 814482 | |
| 9 | 682550 | 6.6% |
| Other values (29) | 85913 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10311762 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1932130 | |
| 4 | 1082823 | |
| 1 | 1062075 | |
| 5 | 1038144 | |
| 3 | 1034312 | |
| 2 | 876562 | |
| 7 | 861219 | |
| 6 | 841552 | |
| 8 | 814482 | |
| 9 | 682550 | 6.6% |
| Other values (29) | 85913 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10311762 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1932130 | |
| 4 | 1082823 | |
| 1 | 1062075 | |
| 5 | 1038144 | |
| 3 | 1034312 | |
| 2 | 876562 | |
| 7 | 861219 | |
| 6 | 841552 | |
| 8 | 814482 | |
| 9 | 682550 | 6.6% |
| Other values (29) | 85913 | 0.8% |
rating
Real number (ℝ)
Zeros 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8390215 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 647323 |
| Zeros (%) | 62.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.8541492 |
|---|---|
| Coefficient of variation (CV) | 1.3575625 |
| Kurtosis | -1.2150103 |
| Mean | 2.8390215 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.75243535 |
| Sum | 2927528 |
| Variance | 14.854466 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 647323 | |
| 8 | 91806 | 8.9% |
| 10 | 71227 | 6.9% |
| 7 | 66404 | 6.4% |
| 9 | 60780 | 5.9% |
| 5 | 45355 | 4.4% |
| 6 | 31689 | 3.1% |
| 4 | 7617 | 0.7% |
| 3 | 5118 | 0.5% |
| 2 | 2375 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 647323 | |
| 1 | 1481 | 0.1% |
| 2 | 2375 | 0.2% |
| 3 | 5118 | 0.5% |
| 4 | 7617 | 0.7% |
| 5 | 45355 | 4.4% |
| 6 | 31689 | 3.1% |
| 7 | 66404 | 6.4% |
| 8 | 91806 | 8.9% |
| 9 | 60780 | 5.9% |
| Value | Count | Frequency (%) |
| 10 | 71227 | |
| 9 | 60780 | |
| 8 | 91806 | |
| 7 | 66404 | |
| 6 | 31689 | 3.1% |
| 5 | 45355 | |
| 4 | 7617 | 0.7% |
| 3 | 5118 | 0.5% |
| 2 | 2375 | 0.2% |
| 1 | 1481 | 0.1% |
book_title
Text
| Distinct | 241090 |
|---|---|
| Distinct (%) | 23.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 89.1 MiB |
Length
| Max length | 256 |
|---|---|
| Median length | 211 |
| Mean length | 32.71261 |
| Min length | 1 |
Unique
| Unique | 127524 ? |
|---|---|
| Unique (%) | 12.4% |
Sample
| 1st row | Classical Mythology |
|---|---|
| 2nd row | Clara Callan |
| 3rd row | Clara Callan |
| 4th row | Clara Callan |
| 5th row | Clara Callan |
| Value | Count | Frequency (%) |
| the | 461771 | 8.2% |
| of | 220678 | 3.9% |
| a | 174061 | 3.1% |
| and | 100167 | 1.8% |
| 85788 | 1.5% | |
| in | 63123 | 1.1% |
| to | 60517 | 1.1% |
| novel | 54387 | 1.0% |
| book | 46932 | 0.8% |
| for | 41821 | 0.7% |
| Other values (93219) | 4296248 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4585156 | 13.6% | |
| e | 3228366 | 9.6% |
| o | 1999873 | 5.9% |
| a | 1855355 | 5.5% |
| r | 1765522 | 5.2% |
| i | 1721233 | 5.1% |
| n | 1681241 | 5.0% |
| t | 1557566 | 4.6% |
| s | 1387806 | 4.1% |
| l | 1084723 | 3.2% |
| Other values (116) | 12865585 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 33732426 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4585156 | 13.6% | |
| e | 3228366 | 9.6% |
| o | 1999873 | 5.9% |
| a | 1855355 | 5.5% |
| r | 1765522 | 5.2% |
| i | 1721233 | 5.1% |
| n | 1681241 | 5.0% |
| t | 1557566 | 4.6% |
| s | 1387806 | 4.1% |
| l | 1084723 | 3.2% |
| Other values (116) | 12865585 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 33732426 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4585156 | 13.6% | |
| e | 3228366 | 9.6% |
| o | 1999873 | 5.9% |
| a | 1855355 | 5.5% |
| r | 1765522 | 5.2% |
| i | 1721233 | 5.1% |
| n | 1681241 | 5.0% |
| t | 1557566 | 4.6% |
| s | 1387806 | 4.1% |
| l | 1084723 | 3.2% |
| Other values (116) | 12865585 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 33732426 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4585156 | 13.6% | |
| e | 3228366 | 9.6% |
| o | 1999873 | 5.9% |
| a | 1855355 | 5.5% |
| r | 1765522 | 5.2% |
| i | 1721233 | 5.1% |
| n | 1681241 | 5.0% |
| t | 1557566 | 4.6% |
| s | 1387806 | 4.1% |
| l | 1084723 | 3.2% |
| Other values (116) | 12865585 |
book_author
Text
| Distinct | 101593 |
|---|---|
| Distinct (%) | 9.9% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 69.7 MiB |
Length
| Max length | 143 |
|---|---|
| Median length | 70 |
| Mean length | 13.751096 |
| Min length | 1 |
Unique
| Unique | 49280 ? |
|---|---|
| Unique (%) | 4.8% |
Sample
| 1st row | Mark P. O. Morford |
|---|---|
| 2nd row | Richard Bruce Wright |
| 3rd row | Richard Bruce Wright |
| 4th row | Richard Bruce Wright |
| 5th row | Richard Bruce Wright |
| Value | Count | Frequency (%) |
| john | 32558 | 1.4% |
| james | 20384 | 0.9% |
| robert | 18457 | 0.8% |
| michael | 16959 | 0.8% |
| stephen | 16524 | 0.7% |
| r | 15345 | 0.7% |
| david | 14916 | 0.7% |
| j | 14678 | 0.6% |
| anne | 14218 | 0.6% |
| mary | 12185 | 0.5% |
| Other values (49131) | 2082249 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1263261 | 8.9% |
| 1230224 | 8.7% | |
| a | 1160642 | 8.2% |
| n | 967461 | 6.8% |
| r | 933572 | 6.6% |
| i | 769864 | 5.4% |
| o | 690693 | 4.9% |
| l | 668913 | 4.7% |
| t | 497532 | 3.5% |
| s | 459031 | 3.2% |
| Other values (99) | 5538580 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14179773 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1263261 | 8.9% |
| 1230224 | 8.7% | |
| a | 1160642 | 8.2% |
| n | 967461 | 6.8% |
| r | 933572 | 6.6% |
| i | 769864 | 5.4% |
| o | 690693 | 4.9% |
| l | 668913 | 4.7% |
| t | 497532 | 3.5% |
| s | 459031 | 3.2% |
| Other values (99) | 5538580 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14179773 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1263261 | 8.9% |
| 1230224 | 8.7% | |
| a | 1160642 | 8.2% |
| n | 967461 | 6.8% |
| r | 933572 | 6.6% |
| i | 769864 | 5.4% |
| o | 690693 | 4.9% |
| l | 668913 | 4.7% |
| t | 497532 | 3.5% |
| s | 459031 | 3.2% |
| Other values (99) | 5538580 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14179773 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1263261 | 8.9% |
| 1230224 | 8.7% | |
| a | 1160642 | 8.2% |
| n | 967461 | 6.8% |
| r | 933572 | 6.6% |
| i | 769864 | 5.4% |
| o | 690693 | 4.9% |
| l | 668913 | 4.7% |
| t | 497532 | 3.5% |
| s | 459031 | 3.2% |
| Other values (99) | 5538580 |
year_of_publication
Real number (ℝ)
| Distinct | 104 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1995.2827 |
| Minimum | 1376 |
|---|---|
| Maximum | 2008 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 MiB |
Quantile statistics
| Minimum | 1376 |
|---|---|
| 5-th percentile | 1982 |
| Q1 | 1992 |
| median | 1997 |
| Q3 | 2001 |
| 95-th percentile | 2003 |
| Maximum | 2008 |
| Range | 632 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 7.3093401 |
|---|---|
| Coefficient of variation (CV) | 0.0036633106 |
| Kurtosis | 107.24842 |
| Mean | 1995.2827 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -3.0531031 |
| Sum | 2.0574856 × 109 |
| Variance | 53.426453 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2002 | 91801 | 8.9% |
| 2001 | 79803 | 7.7% |
| 1999 | 75195 | 7.3% |
| 2003 | 72539 | 7.0% |
| 2000 | 72334 | 7.0% |
| 1998 | 64209 | 6.2% |
| 1994 | 60533 | 5.9% |
| 1997 | 59361 | 5.8% |
| 1996 | 58826 | 5.7% |
| 1995 | 54093 | 5.2% |
| Other values (94) | 342481 |
| Value | Count | Frequency (%) |
| 1376 | 1 | < 0.1% |
| 1378 | 1 | < 0.1% |
| 1806 | 1 | < 0.1% |
| 1897 | 1 | < 0.1% |
| 1900 | 4 | < 0.1% |
| 1901 | 7 | |
| 1902 | 10 | |
| 1904 | 1 | < 0.1% |
| 1906 | 1 | < 0.1% |
| 1908 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 2008 | 1 | < 0.1% |
| 2006 | 3 | < 0.1% |
| 2005 | 122 | < 0.1% |
| 2004 | 25971 | 2.5% |
| 2003 | 72539 | |
| 2002 | 91801 | |
| 2001 | 79803 | |
| 2000 | 72334 | |
| 1999 | 75195 | |
| 1998 | 64209 |
publisher
Text
| Distinct | 16729 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.0 MiB |
Length
| Max length | 134 |
|---|---|
| Median length | 88 |
| Mean length | 14.088498 |
| Min length | 1 |
Unique
| Unique | 7153 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Oxford University Press |
|---|---|
| 2nd row | HarperFlamingo Canada |
| 3rd row | HarperFlamingo Canada |
| 4th row | HarperFlamingo Canada |
| 5th row | HarperFlamingo Canada |
| Value | Count | Frequency (%) |
| books | 289588 | 13.3% |
| publishing | 74377 | 3.4% |
| bantam | 53309 | 2.4% |
| press | 51201 | 2.4% |
| group | 49822 | 2.3% |
| 39280 | 1.8% | |
| 38651 | 1.8% | |
| dell | 35068 | 1.6% |
| ballantine | 34864 | 1.6% |
| warner | 33717 | 1.5% |
| Other values (11400) | 1478364 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1312896 | 9.0% |
| 1147070 | 7.9% | |
| e | 1061244 | 7.3% |
| n | 930394 | 6.4% |
| a | 895126 | 6.2% |
| r | 884118 | 6.1% |
| s | 852276 | 5.9% |
| i | 784350 | 5.4% |
| l | 657268 | 4.5% |
| t | 589550 | 4.1% |
| Other values (105) | 5413415 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14527707 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1312896 | 9.0% |
| 1147070 | 7.9% | |
| e | 1061244 | 7.3% |
| n | 930394 | 6.4% |
| a | 895126 | 6.2% |
| r | 884118 | 6.1% |
| s | 852276 | 5.9% |
| i | 784350 | 5.4% |
| l | 657268 | 4.5% |
| t | 589550 | 4.1% |
| Other values (105) | 5413415 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14527707 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1312896 | 9.0% |
| 1147070 | 7.9% | |
| e | 1061244 | 7.3% |
| n | 930394 | 6.4% |
| a | 895126 | 6.2% |
| r | 884118 | 6.1% |
| s | 852276 | 5.9% |
| i | 784350 | 5.4% |
| l | 657268 | 4.5% |
| t | 589550 | 4.1% |
| Other values (105) | 5413415 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14527707 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1312896 | 9.0% |
| 1147070 | 7.9% | |
| e | 1061244 | 7.3% |
| n | 930394 | 6.4% |
| a | 895126 | 6.2% |
| r | 884118 | 6.1% |
| s | 852276 | 5.9% |
| i | 784350 | 5.4% |
| l | 657268 | 4.5% |
| t | 589550 | 4.1% |
| Other values (105) | 5413415 |
img_s
URL
| Distinct | 269861 |
|---|---|
| Distinct (%) | 26.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 115.1 MiB |
| http://images.amazon.com/images/P/0971880107.01.THUMBZZZ.jpg | 2502 |
|---|---|
| http://images.amazon.com/images/P/0316666343.01.THUMBZZZ.jpg | 1295 |
| http://images.amazon.com/images/P/0385504209.01.THUMBZZZ.jpg | 883 |
| http://images.amazon.com/images/P/0060928336.01.THUMBZZZ.jpg | 732 |
| http://images.amazon.com/images/P/0312195516.01.THUMBZZZ.jpg | 723 |
| Other values (269856) |
| Value | Count | Frequency (%) |
| http://images.amazon.com/images/P/0971880107.01.THUMBZZZ.jpg | 2502 | 0.2% |
| http://images.amazon.com/images/P/0316666343.01.THUMBZZZ.jpg | 1295 | 0.1% |
| http://images.amazon.com/images/P/0385504209.01.THUMBZZZ.jpg | 883 | 0.1% |
| http://images.amazon.com/images/P/0060928336.01.THUMBZZZ.jpg | 732 | 0.1% |
| http://images.amazon.com/images/P/0312195516.01.THUMBZZZ.jpg | 723 | 0.1% |
| http://images.amazon.com/images/P/044023722X.01.THUMBZZZ.jpg | 649 | 0.1% |
| http://images.amazon.com/images/P/067976402X.01.THUMBZZZ.jpg | 618 | 0.1% |
| http://images.amazon.com/images/P/0142001740.01.THUMBZZZ.jpg | 615 | 0.1% |
| http://images.amazon.com/images/P/0671027360.01.THUMBZZZ.jpg | 586 | 0.1% |
| http://images.amazon.com/images/P/0446672211.01.THUMBZZZ.jpg | 585 | 0.1% |
| Other values (269851) | 1021987 |
| Value | Count | Frequency (%) |
| http | 1031175 |
| Value | Count | Frequency (%) |
| images.amazon.com | 1031175 |
| Value | Count | Frequency (%) |
| /images/P/0971880107.01.THUMBZZZ.jpg | 2502 | 0.2% |
| /images/P/0316666343.01.THUMBZZZ.jpg | 1295 | 0.1% |
| /images/P/0385504209.01.THUMBZZZ.jpg | 883 | 0.1% |
| /images/P/0060928336.01.THUMBZZZ.jpg | 732 | 0.1% |
| /images/P/0312195516.01.THUMBZZZ.jpg | 723 | 0.1% |
| /images/P/044023722X.01.THUMBZZZ.jpg | 649 | 0.1% |
| /images/P/067976402X.01.THUMBZZZ.jpg | 618 | 0.1% |
| /images/P/0142001740.01.THUMBZZZ.jpg | 615 | 0.1% |
| /images/P/0671027360.01.THUMBZZZ.jpg | 586 | 0.1% |
| /images/P/0446672211.01.THUMBZZZ.jpg | 585 | 0.1% |
| Other values (269851) | 1021987 |
| Value | Count | Frequency (%) |
| 1031175 |
| Value | Count | Frequency (%) |
| 1031175 |
img_m
URL
| Distinct | 269861 |
|---|---|
| Distinct (%) | 26.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 115.1 MiB |
| http://images.amazon.com/images/P/0971880107.01.MZZZZZZZ.jpg | 2502 |
|---|---|
| http://images.amazon.com/images/P/0316666343.01.MZZZZZZZ.jpg | 1295 |
| http://images.amazon.com/images/P/0385504209.01.MZZZZZZZ.jpg | 883 |
| http://images.amazon.com/images/P/0060928336.01.MZZZZZZZ.jpg | 732 |
| http://images.amazon.com/images/P/0312195516.01.MZZZZZZZ.jpg | 723 |
| Other values (269856) |
| Value | Count | Frequency (%) |
| http://images.amazon.com/images/P/0971880107.01.MZZZZZZZ.jpg | 2502 | 0.2% |
| http://images.amazon.com/images/P/0316666343.01.MZZZZZZZ.jpg | 1295 | 0.1% |
| http://images.amazon.com/images/P/0385504209.01.MZZZZZZZ.jpg | 883 | 0.1% |
| http://images.amazon.com/images/P/0060928336.01.MZZZZZZZ.jpg | 732 | 0.1% |
| http://images.amazon.com/images/P/0312195516.01.MZZZZZZZ.jpg | 723 | 0.1% |
| http://images.amazon.com/images/P/044023722X.01.MZZZZZZZ.jpg | 649 | 0.1% |
| http://images.amazon.com/images/P/067976402X.01.MZZZZZZZ.jpg | 618 | 0.1% |
| http://images.amazon.com/images/P/0142001740.01.MZZZZZZZ.jpg | 615 | 0.1% |
| http://images.amazon.com/images/P/0671027360.01.MZZZZZZZ.jpg | 586 | 0.1% |
| http://images.amazon.com/images/P/0446672211.01.MZZZZZZZ.jpg | 585 | 0.1% |
| Other values (269851) | 1021987 |
| Value | Count | Frequency (%) |
| http | 1031175 |
| Value | Count | Frequency (%) |
| images.amazon.com | 1031175 |
| Value | Count | Frequency (%) |
| /images/P/0971880107.01.MZZZZZZZ.jpg | 2502 | 0.2% |
| /images/P/0316666343.01.MZZZZZZZ.jpg | 1295 | 0.1% |
| /images/P/0385504209.01.MZZZZZZZ.jpg | 883 | 0.1% |
| /images/P/0060928336.01.MZZZZZZZ.jpg | 732 | 0.1% |
| /images/P/0312195516.01.MZZZZZZZ.jpg | 723 | 0.1% |
| /images/P/044023722X.01.MZZZZZZZ.jpg | 649 | 0.1% |
| /images/P/067976402X.01.MZZZZZZZ.jpg | 618 | 0.1% |
| /images/P/0142001740.01.MZZZZZZZ.jpg | 615 | 0.1% |
| /images/P/0671027360.01.MZZZZZZZ.jpg | 586 | 0.1% |
| /images/P/0446672211.01.MZZZZZZZ.jpg | 585 | 0.1% |
| Other values (269851) | 1021987 |
| Value | Count | Frequency (%) |
| 1031175 |
| Value | Count | Frequency (%) |
| 1031175 |
img_l
URL
| Distinct | 269861 |
|---|---|
| Distinct (%) | 26.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 115.1 MiB |
| http://images.amazon.com/images/P/0971880107.01.LZZZZZZZ.jpg | 2502 |
|---|---|
| http://images.amazon.com/images/P/0316666343.01.LZZZZZZZ.jpg | 1295 |
| http://images.amazon.com/images/P/0385504209.01.LZZZZZZZ.jpg | 883 |
| http://images.amazon.com/images/P/0060928336.01.LZZZZZZZ.jpg | 732 |
| http://images.amazon.com/images/P/0312195516.01.LZZZZZZZ.jpg | 723 |
| Other values (269856) |
| Value | Count | Frequency (%) |
| http://images.amazon.com/images/P/0971880107.01.LZZZZZZZ.jpg | 2502 | 0.2% |
| http://images.amazon.com/images/P/0316666343.01.LZZZZZZZ.jpg | 1295 | 0.1% |
| http://images.amazon.com/images/P/0385504209.01.LZZZZZZZ.jpg | 883 | 0.1% |
| http://images.amazon.com/images/P/0060928336.01.LZZZZZZZ.jpg | 732 | 0.1% |
| http://images.amazon.com/images/P/0312195516.01.LZZZZZZZ.jpg | 723 | 0.1% |
| http://images.amazon.com/images/P/044023722X.01.LZZZZZZZ.jpg | 649 | 0.1% |
| http://images.amazon.com/images/P/067976402X.01.LZZZZZZZ.jpg | 618 | 0.1% |
| http://images.amazon.com/images/P/0142001740.01.LZZZZZZZ.jpg | 615 | 0.1% |
| http://images.amazon.com/images/P/0671027360.01.LZZZZZZZ.jpg | 586 | 0.1% |
| http://images.amazon.com/images/P/0446672211.01.LZZZZZZZ.jpg | 585 | 0.1% |
| Other values (269851) | 1021987 |
| Value | Count | Frequency (%) |
| http | 1031175 |
| Value | Count | Frequency (%) |
| images.amazon.com | 1031175 |
| Value | Count | Frequency (%) |
| /images/P/0971880107.01.LZZZZZZZ.jpg | 2502 | 0.2% |
| /images/P/0316666343.01.LZZZZZZZ.jpg | 1295 | 0.1% |
| /images/P/0385504209.01.LZZZZZZZ.jpg | 883 | 0.1% |
| /images/P/0060928336.01.LZZZZZZZ.jpg | 732 | 0.1% |
| /images/P/0312195516.01.LZZZZZZZ.jpg | 723 | 0.1% |
| /images/P/044023722X.01.LZZZZZZZ.jpg | 649 | 0.1% |
| /images/P/067976402X.01.LZZZZZZZ.jpg | 618 | 0.1% |
| /images/P/0142001740.01.LZZZZZZZ.jpg | 615 | 0.1% |
| /images/P/0671027360.01.LZZZZZZZ.jpg | 586 | 0.1% |
| /images/P/0446672211.01.LZZZZZZZ.jpg | 585 | 0.1% |
| Other values (269851) | 1021987 |
| Value | Count | Frequency (%) |
| 1031175 |
| Value | Count | Frequency (%) |
| 1031175 |
Summary
Text
| Distinct | 136911 |
|---|---|
| Distinct (%) | 13.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 169.9 MiB |
Length
| Max length | 374 |
|---|---|
| Median length | 297 |
| Mean length | 109.08555 |
| Min length | 1 |
Unique
| Unique | 66037 ? |
|---|---|
| Unique (%) | 6.4% |
Sample
| 1st row | Provides an introduction to classical myths placing the addressed topics within their historical context, discussion of archaeological evidence as support for mythical events, and how these themes have been portrayed in literature, art, ... |
|---|---|
| 2nd row | In a small town in Canada, Clara Callan reluctantly takes leave of her sister, Nora, who is bound for New York. |
| 3rd row | In a small town in Canada, Clara Callan reluctantly takes leave of her sister, Nora, who is bound for New York. |
| 4th row | In a small town in Canada, Clara Callan reluctantly takes leave of her sister, Nora, who is bound for New York. |
| 5th row | In a small town in Canada, Clara Callan reluctantly takes leave of her sister, Nora, who is bound for New York. |
| Value | Count | Frequency (%) |
| the | 1041929 | 5.6% |
| of | 713907 | 3.8% |
| a | 698979 | 3.8% |
| and | 631892 | 3.4% |
| to | 429704 | 2.3% |
| 9 | 399202 | 2.2% |
| in | 345630 | 1.9% |
| her | 226898 | 1.2% |
| is | 210285 | 1.1% |
| for | 162808 | 0.9% |
| Other values (155678) | 13681897 |
Most occurring characters
| Value | Count | Frequency (%) |
| 16187576 | ||
| e | 10716585 | 9.5% |
| a | 7205834 | 6.4% |
| t | 7065437 | 6.3% |
| n | 6550881 | 5.8% |
| i | 6532615 | 5.8% |
| o | 6526239 | 5.8% |
| r | 6122476 | 5.4% |
| s | 5904740 | 5.2% |
| h | 4278200 | 3.8% |
| Other values (456) | 35395714 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 112486297 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 16187576 | ||
| e | 10716585 | 9.5% |
| a | 7205834 | 6.4% |
| t | 7065437 | 6.3% |
| n | 6550881 | 5.8% |
| i | 6532615 | 5.8% |
| o | 6526239 | 5.8% |
| r | 6122476 | 5.4% |
| s | 5904740 | 5.2% |
| h | 4278200 | 3.8% |
| Other values (456) | 35395714 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 112486297 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 16187576 | ||
| e | 10716585 | 9.5% |
| a | 7205834 | 6.4% |
| t | 7065437 | 6.3% |
| n | 6550881 | 5.8% |
| i | 6532615 | 5.8% |
| o | 6526239 | 5.8% |
| r | 6122476 | 5.4% |
| s | 5904740 | 5.2% |
| h | 4278200 | 3.8% |
| Other values (456) | 35395714 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 112486297 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 16187576 | ||
| e | 10716585 | 9.5% |
| a | 7205834 | 6.4% |
| t | 7065437 | 6.3% |
| n | 6550881 | 5.8% |
| i | 6532615 | 5.8% |
| o | 6526239 | 5.8% |
| r | 6122476 | 5.4% |
| s | 5904740 | 5.2% |
| h | 4278200 | 3.8% |
| Other values (456) | 35395714 |
Language
Categorical
Imbalance 
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.6 MiB |
| en | |
|---|---|
| 9 | |
| de | 5725 |
| es | 3425 |
| fr | 3223 |
| Other values (28) | 1360 |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 1.6131559 |
| Min length | 1 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
Common Values
| Value | Count | Frequency (%) |
| en | 618505 | |
| 9 | 398937 | |
| de | 5725 | 0.6% |
| es | 3425 | 0.3% |
| fr | 3223 | 0.3% |
| it | 732 | 0.1% |
| nl | 238 | < 0.1% |
| da | 119 | < 0.1% |
| pt | 100 | < 0.1% |
| ca | 49 | < 0.1% |
| Other values (23) | 122 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| en | 618505 | |
| 9 | 398937 | |
| de | 5725 | 0.6% |
| es | 3425 | 0.3% |
| fr | 3223 | 0.3% |
| it | 732 | 0.1% |
| nl | 238 | < 0.1% |
| da | 119 | < 0.1% |
| pt | 100 | < 0.1% |
| ca | 49 | < 0.1% |
| Other values (23) | 122 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 627663 | |
| n | 618756 | |
| 9 | 398937 | |
| d | 5846 | 0.4% |
| s | 3437 | 0.2% |
| r | 3251 | 0.2% |
| f | 3225 | 0.2% |
| t | 839 | 0.1% |
| i | 736 | < 0.1% |
| l | 268 | < 0.1% |
| Other values (18) | 488 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1663446 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 627663 | |
| n | 618756 | |
| 9 | 398937 | |
| d | 5846 | 0.4% |
| s | 3437 | 0.2% |
| r | 3251 | 0.2% |
| f | 3225 | 0.2% |
| t | 839 | 0.1% |
| i | 736 | < 0.1% |
| l | 268 | < 0.1% |
| Other values (18) | 488 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1663446 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 627663 | |
| n | 618756 | |
| 9 | 398937 | |
| d | 5846 | 0.4% |
| s | 3437 | 0.2% |
| r | 3251 | 0.2% |
| f | 3225 | 0.2% |
| t | 839 | 0.1% |
| i | 736 | < 0.1% |
| l | 268 | < 0.1% |
| Other values (18) | 488 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1663446 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 627663 | |
| n | 618756 | |
| 9 | 398937 | |
| d | 5846 | 0.4% |
| s | 3437 | 0.2% |
| r | 3251 | 0.2% |
| f | 3225 | 0.2% |
| t | 839 | 0.1% |
| i | 736 | < 0.1% |
| l | 268 | < 0.1% |
| Other values (18) | 488 | < 0.1% |
Category
Text
| Distinct | 6448 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 64.8 MiB |
Length
| Max length | 279 |
|---|---|
| Median length | 118 |
| Mean length | 8.9121536 |
| Min length | 1 |
Unique
| Unique | 2426 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | ['Social Science'] |
|---|---|
| 2nd row | ['Actresses'] |
| 3rd row | ['Actresses'] |
| 4th row | ['Actresses'] |
| 5th row | ['Actresses'] |
| Value | Count | Frequency (%) |
| fiction | 433139 | |
| 9 | 406102 | |
| 48094 | 3.8% | |
| juvenile | 45410 | 3.6% |
| biography | 22621 | 1.8% |
| autobiography | 22549 | 1.8% |
| science | 10079 | 0.8% |
| humor | 9028 | 0.7% |
| history | 8520 | 0.7% |
| religion | 7307 | 0.6% |
| Other values (5677) | 256601 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 1248035 | |
| i | 1170814 | |
| o | 652437 | 7.1% |
| [ | 625074 | 6.8% |
| ] | 625074 | 6.8% |
| n | 614275 | 6.7% |
| t | 583967 | 6.4% |
| c | 530396 | 5.8% |
| F | 443764 | 4.8% |
| 9 | 406985 | 4.4% |
| Other values (102) | 2289169 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9189990 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| ' | 1248035 | |
| i | 1170814 | |
| o | 652437 | 7.1% |
| [ | 625074 | 6.8% |
| ] | 625074 | 6.8% |
| n | 614275 | 6.7% |
| t | 583967 | 6.4% |
| c | 530396 | 5.8% |
| F | 443764 | 4.8% |
| 9 | 406985 | 4.4% |
| Other values (102) | 2289169 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9189990 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| ' | 1248035 | |
| i | 1170814 | |
| o | 652437 | 7.1% |
| [ | 625074 | 6.8% |
| ] | 625074 | 6.8% |
| n | 614275 | 6.7% |
| t | 583967 | 6.4% |
| c | 530396 | 5.8% |
| F | 443764 | 4.8% |
| 9 | 406985 | 4.4% |
| Other values (102) | 2289169 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9189990 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| ' | 1248035 | |
| i | 1170814 | |
| o | 652437 | 7.1% |
| [ | 625074 | 6.8% |
| ] | 625074 | 6.8% |
| n | 614275 | 6.7% |
| t | 583967 | 6.4% |
| c | 530396 | 5.8% |
| F | 443764 | 4.8% |
| 9 | 406985 | 4.4% |
| Other values (102) | 2289169 |
city
Text
Missing 
| Distinct | 14767 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 14103 |
| Missing (%) | 1.4% |
| Memory size | 64.4 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 37 |
| Mean length | 8.7001314 |
| Min length | 1 |
Unique
| Unique | 4855 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | stockton |
|---|---|
| 2nd row | timmins |
| 3rd row | ottawa |
| 4th row | sudbury |
| 5th row | toronto |
| Value | Count | Frequency (%) |
| san | 22513 | 1.8% |
| st | 17816 | 1.4% |
| city | 15594 | 1.2% |
| toronto | 15124 | 1.2% |
| louis | 13628 | 1.1% |
| new | 11591 | 0.9% |
| beach | 11030 | 0.9% |
| chicago | 9418 | 0.7% |
| ft | 9224 | 0.7% |
| little | 8617 | 0.7% |
| Other values (12983) | 1146436 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 878949 | 9.9% |
| e | 774237 | 8.7% |
| o | 748122 | 8.5% |
| n | 703817 | 8.0% |
| l | 637878 | 7.2% |
| r | 608874 | 6.9% |
| i | 550374 | 6.2% |
| t | 528345 | 6.0% |
| s | 502627 | 5.7% |
| c | 314316 | 3.6% |
| Other values (74) | 2601121 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8848660 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 878949 | 9.9% |
| e | 774237 | 8.7% |
| o | 748122 | 8.5% |
| n | 703817 | 8.0% |
| l | 637878 | 7.2% |
| r | 608874 | 6.9% |
| i | 550374 | 6.2% |
| t | 528345 | 6.0% |
| s | 502627 | 5.7% |
| c | 314316 | 3.6% |
| Other values (74) | 2601121 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8848660 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 878949 | 9.9% |
| e | 774237 | 8.7% |
| o | 748122 | 8.5% |
| n | 703817 | 8.0% |
| l | 637878 | 7.2% |
| r | 608874 | 6.9% |
| i | 550374 | 6.2% |
| t | 528345 | 6.0% |
| s | 502627 | 5.7% |
| c | 314316 | 3.6% |
| Other values (74) | 2601121 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8848660 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 878949 | 9.9% |
| e | 774237 | 8.7% |
| o | 748122 | 8.5% |
| n | 703817 | 8.0% |
| l | 637878 | 7.2% |
| r | 608874 | 6.9% |
| i | 550374 | 6.2% |
| t | 528345 | 6.0% |
| s | 502627 | 5.7% |
| c | 314316 | 3.6% |
| Other values (74) | 2601121 |
state
Text
Missing 
| Distinct | 2123 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 22798 |
| Missing (%) | 2.2% |
| Memory size | 64.1 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 37 |
| Mean length | 8.6172324 |
| Min length | 1 |
Unique
| Unique | 767 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | california |
|---|---|
| 2nd row | ontario |
| 3rd row | ontario |
| 4th row | ontario |
| 5th row | ontario |
| Value | Count | Frequency (%) |
| california | 107500 | 9.2% |
| new | 69766 | 6.0% |
| texas | 44173 | 3.8% |
| ontario | 41455 | 3.5% |
| virginia | 34999 | 3.0% |
| florida | 34258 | 2.9% |
| missouri | 33007 | 2.8% |
| washington | 31956 | 2.7% |
| illinois | 30626 | 2.6% |
| york | 29790 | 2.5% |
| Other values (1913) | 713273 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1136556 | |
| i | 1017765 | |
| n | 899518 | |
| o | 753906 | 8.7% |
| r | 611153 | 7.0% |
| e | 565023 | 6.5% |
| s | 546331 | 6.3% |
| l | 444416 | 5.1% |
| t | 376764 | 4.3% |
| c | 309364 | 3.6% |
| Other values (69) | 2028623 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8689419 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1136556 | |
| i | 1017765 | |
| n | 899518 | |
| o | 753906 | 8.7% |
| r | 611153 | 7.0% |
| e | 565023 | 6.5% |
| s | 546331 | 6.3% |
| l | 444416 | 5.1% |
| t | 376764 | 4.3% |
| c | 309364 | 3.6% |
| Other values (69) | 2028623 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8689419 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1136556 | |
| i | 1017765 | |
| n | 899518 | |
| o | 753906 | 8.7% |
| r | 611153 | 7.0% |
| e | 565023 | 6.5% |
| s | 546331 | 6.3% |
| l | 444416 | 5.1% |
| t | 376764 | 4.3% |
| c | 309364 | 3.6% |
| Other values (69) | 2028623 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8689419 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1136556 | |
| i | 1017765 | |
| n | 899518 | |
| o | 753906 | 8.7% |
| r | 611153 | 7.0% |
| e | 565023 | 6.5% |
| s | 546331 | 6.3% |
| l | 444416 | 5.1% |
| t | 376764 | 4.3% |
| c | 309364 | 3.6% |
| Other values (69) | 2028623 |
country
Text
Missing 
| Distinct | 414 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 35374 |
| Missing (%) | 3.4% |
| Memory size | 59.2 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 3 |
| Mean length | 4.2359548 |
| Min length | 1 |
Unique
| Unique | 147 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | usa |
|---|---|
| 2nd row | canada |
| 3rd row | canada |
| 4th row | canada |
| 5th row | canada |
| Value | Count | Frequency (%) |
| usa | 746518 | |
| canada | 93005 | 8.9% |
| united | 33346 | 3.2% |
| kingdom | 33078 | 3.2% |
| germany | 27665 | 2.7% |
| australia | 18239 | 1.8% |
| spain | 14989 | 1.4% |
| france | 10658 | 1.0% |
| portugal | 6984 | 0.7% |
| new | 5968 | 0.6% |
| Other values (405) | 49849 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1206904 | |
| u | 811861 | |
| s | 802584 | |
| n | 257087 | 6.1% |
| d | 180767 | 4.3% |
| i | 133129 | 3.2% |
| e | 109059 | 2.6% |
| c | 108524 | 2.6% |
| r | 87496 | 2.1% |
| t | 76864 | 1.8% |
| Other values (43) | 443893 | 10.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4218168 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1206904 | |
| u | 811861 | |
| s | 802584 | |
| n | 257087 | 6.1% |
| d | 180767 | 4.3% |
| i | 133129 | 3.2% |
| e | 109059 | 2.6% |
| c | 108524 | 2.6% |
| r | 87496 | 2.1% |
| t | 76864 | 1.8% |
| Other values (43) | 443893 | 10.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4218168 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1206904 | |
| u | 811861 | |
| s | 802584 | |
| n | 257087 | 6.1% |
| d | 180767 | 4.3% |
| i | 133129 | 3.2% |
| e | 109059 | 2.6% |
| c | 108524 | 2.6% |
| r | 87496 | 2.1% |
| t | 76864 | 1.8% |
| Other values (43) | 443893 | 10.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4218168 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1206904 | |
| u | 811861 | |
| s | 802584 | |
| n | 257087 | 6.1% |
| d | 180767 | 4.3% |
| i | 133129 | 3.2% |
| e | 109059 | 2.6% |
| c | 108524 | 2.6% |
| r | 87496 | 2.1% |
| t | 76864 | 1.8% |
| Other values (43) | 443893 | 10.5% |
Interactions
Correlations
| Language | Unnamed: 0 | age | rating | user_id | year_of_publication | |
|---|---|---|---|---|---|---|
| Language | 1.000 | 0.041 | 0.019 | 0.013 | 0.011 | 0.289 |
| Unnamed: 0 | 0.041 | 1.000 | 0.039 | -0.042 | 0.170 | -0.158 |
| age | 0.019 | 0.039 | 1.000 | -0.018 | -0.013 | -0.009 |
| rating | 0.013 | -0.042 | -0.018 | 1.000 | -0.044 | 0.052 |
| user_id | 0.011 | 0.170 | -0.013 | -0.044 | 1.000 | -0.013 |
| year_of_publication | 0.289 | -0.158 | -0.009 | 0.052 | -0.013 | 1.000 |
Missing values
Sample
| Unnamed: 0 | user_id | location | age | isbn | rating | book_title | book_author | year_of_publication | publisher | img_s | img_m | img_l | Summary | Language | Category | city | state | country | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2 | stockton, california, usa | 18.0000 | 0195153448 | 0 | Classical Mythology | Mark P. O. Morford | 2002.0 | Oxford University Press | http://images.amazon.com/images/P/0195153448.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0195153448.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0195153448.01.LZZZZZZZ.jpg | Provides an introduction to classical myths placing the addressed\ntopics within their historical context, discussion of archaeological\nevidence as support for mythical events, and how these themes have\nbeen portrayed in literature, art, ... | en | ['Social Science'] | stockton | california | usa |
| 1 | 1 | 8 | timmins, ontario, canada | 34.7439 | 0002005018 | 5 | Clara Callan | Richard Bruce Wright | 2001.0 | HarperFlamingo Canada | http://images.amazon.com/images/P/0002005018.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.LZZZZZZZ.jpg | In a small town in Canada, Clara Callan reluctantly takes leave of her\nsister, Nora, who is bound for New York. | en | ['Actresses'] | timmins | ontario | canada |
| 2 | 2 | 11400 | ottawa, ontario, canada | 49.0000 | 0002005018 | 0 | Clara Callan | Richard Bruce Wright | 2001.0 | HarperFlamingo Canada | http://images.amazon.com/images/P/0002005018.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.LZZZZZZZ.jpg | In a small town in Canada, Clara Callan reluctantly takes leave of her\nsister, Nora, who is bound for New York. | en | ['Actresses'] | ottawa | ontario | canada |
| 3 | 3 | 11676 | n/a, n/a, n/a | 34.7439 | 0002005018 | 8 | Clara Callan | Richard Bruce Wright | 2001.0 | HarperFlamingo Canada | http://images.amazon.com/images/P/0002005018.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.LZZZZZZZ.jpg | In a small town in Canada, Clara Callan reluctantly takes leave of her\nsister, Nora, who is bound for New York. | en | ['Actresses'] | NaN | NaN | NaN |
| 4 | 4 | 41385 | sudbury, ontario, canada | 34.7439 | 0002005018 | 0 | Clara Callan | Richard Bruce Wright | 2001.0 | HarperFlamingo Canada | http://images.amazon.com/images/P/0002005018.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.LZZZZZZZ.jpg | In a small town in Canada, Clara Callan reluctantly takes leave of her\nsister, Nora, who is bound for New York. | en | ['Actresses'] | sudbury | ontario | canada |
| 5 | 5 | 67544 | toronto, ontario, canada | 30.0000 | 0002005018 | 8 | Clara Callan | Richard Bruce Wright | 2001.0 | HarperFlamingo Canada | http://images.amazon.com/images/P/0002005018.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.LZZZZZZZ.jpg | In a small town in Canada, Clara Callan reluctantly takes leave of her\nsister, Nora, who is bound for New York. | en | ['Actresses'] | toronto | ontario | canada |
| 6 | 6 | 85526 | victoria, british columbia, canada | 36.0000 | 0002005018 | 0 | Clara Callan | Richard Bruce Wright | 2001.0 | HarperFlamingo Canada | http://images.amazon.com/images/P/0002005018.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.LZZZZZZZ.jpg | In a small town in Canada, Clara Callan reluctantly takes leave of her\nsister, Nora, who is bound for New York. | en | ['Actresses'] | victoria | british columbia | canada |
| 7 | 7 | 96054 | ottawa, ontario, canada | 29.0000 | 0002005018 | 0 | Clara Callan | Richard Bruce Wright | 2001.0 | HarperFlamingo Canada | http://images.amazon.com/images/P/0002005018.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.LZZZZZZZ.jpg | In a small town in Canada, Clara Callan reluctantly takes leave of her\nsister, Nora, who is bound for New York. | en | ['Actresses'] | ottawa | ontario | canada |
| 8 | 8 | 116866 | ottawa, , | 34.7439 | 0002005018 | 9 | Clara Callan | Richard Bruce Wright | 2001.0 | HarperFlamingo Canada | http://images.amazon.com/images/P/0002005018.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.LZZZZZZZ.jpg | In a small town in Canada, Clara Callan reluctantly takes leave of her\nsister, Nora, who is bound for New York. | en | ['Actresses'] | ottawa | , | NaN |
| 9 | 9 | 123629 | kingston, ontario, canada | 34.7439 | 0002005018 | 9 | Clara Callan | Richard Bruce Wright | 2001.0 | HarperFlamingo Canada | http://images.amazon.com/images/P/0002005018.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0002005018.01.LZZZZZZZ.jpg | In a small town in Canada, Clara Callan reluctantly takes leave of her\nsister, Nora, who is bound for New York. | en | ['Actresses'] | kingston | ontario | canada |
| Unnamed: 0 | user_id | location | age | isbn | rating | book_title | book_author | year_of_publication | publisher | img_s | img_m | img_l | Summary | Language | Category | city | state | country | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1031165 | 1031165 | 278843 | pismo beach, california, usa | 28.0 | 1874061149 | 0 | The Queen's Gambit | Walter Tevis | 1996.0 | Texas Bookman | http://images.amazon.com/images/P/1874061149.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/1874061149.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/1874061149.01.LZZZZZZZ.jpg | Engaging and fast-paced, The Queen's Gambit speeds to a conclusion\nas elegant and satisfying as a mate in four. | en | ['Fiction'] | pismo beach | california | usa |
| 1031166 | 1031166 | 278849 | georgetown, ontario, canada | 23.0 | 0920656307 | 0 | Secret of Willow Castle | Ly Cook | 1911.0 | Firefly Books Ltd | http://images.amazon.com/images/P/0920656307.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0920656307.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0920656307.01.LZZZZZZZ.jpg | Canadian story early 19th century Orphaned servant girl sent to farm. | en | ['Canada'] | georgetown | ontario | canada |
| 1031167 | 1031167 | 278851 | dallas, texas, usa | 33.0 | 0028630289 | 0 | Frommer's 2000 California (Frommer's California 2000) | Erika Lenkert | 1999.0 | Frommer's | http://images.amazon.com/images/P/0028630289.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0028630289.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0028630289.01.LZZZZZZZ.jpg | 9 | 9 | 9 | dallas | texas | usa |
| 1031168 | 1031168 | 278851 | dallas, texas, usa | 33.0 | 0312266448 | 0 | The Military Quotation Book : Revised and Expanded: More than 1,200 of the Best Quotations About War, Leadership, Courage, Victory, and Defeat | James Charlton | 2002.0 | Thomas Dunne Books | http://images.amazon.com/images/P/0312266448.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0312266448.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0312266448.01.LZZZZZZZ.jpg | Contains more than 1,200 quotations about war, courage, combat,\nvictory, and defeat, by such people as Charles Dickens, George Patton,\nWinston Churchill, Voltaire, and Colin Powell. | en | ['Reference'] | dallas | texas | usa |
| 1031169 | 1031169 | 278851 | dallas, texas, usa | 33.0 | 067161746X | 7 | The Bachelor Home Companion: A Practical Guide to Keeping House Like a Pig | P.J. O'Rourke | 1987.0 | Pocket Books | http://images.amazon.com/images/P/067161746X.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/067161746X.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/067161746X.01.LZZZZZZZ.jpg | A tongue-in-cheek survival guide for single people reveals the\nquintessential secrets of no-fuss housekeeping | en | ['Humor'] | dallas | texas | usa |
| 1031170 | 1031170 | 278851 | dallas, texas, usa | 33.0 | 0743203763 | 0 | As Hogan Said . . . : The 389 Best Things Anyone Said about How to Play Golf | Randy Voorhees | 2000.0 | Simon & Schuster | http://images.amazon.com/images/P/0743203763.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0743203763.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0743203763.01.LZZZZZZZ.jpg | Golf lovers will revel in this collection of tips, wisdom, and\nquotations culled from the masters of the game, including Bobby Jones,\nJack Nichlaus, Sam Snead, Tom Watson, and Tiger Woods. 60,000 first\nprinting. | en | ['Humor'] | dallas | texas | usa |
| 1031171 | 1031171 | 278851 | dallas, texas, usa | 33.0 | 0767907566 | 5 | All Elevations Unknown: An Adventure in the Heart of Borneo | Sam Lightner | 2001.0 | Broadway Books | http://images.amazon.com/images/P/0767907566.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0767907566.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0767907566.01.LZZZZZZZ.jpg | A daring twist on the travel-adventure genre that places the talented\nLightner in the ranks of authors such as Jon Krakauer, Sebastian\nJunger, and Redmond O'Hanlon, All Elevations Unknown is ultimately\nthe remarkable story of two ... | en | ['Nature'] | dallas | texas | usa |
| 1031172 | 1031172 | 278851 | dallas, texas, usa | 33.0 | 0884159221 | 7 | Why stop?: A guide to Texas historical roadside markers | Claude Dooley | 1985.0 | Lone Star Books | http://images.amazon.com/images/P/0884159221.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0884159221.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0884159221.01.LZZZZZZZ.jpg | 9 | 9 | 9 | dallas | texas | usa |
| 1031173 | 1031173 | 278851 | dallas, texas, usa | 33.0 | 0912333022 | 7 | The Are You Being Served? Stories: 'Camping In' and Other Fiascoes | Jeremy Lloyd | 1997.0 | Kqed Books | http://images.amazon.com/images/P/0912333022.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/0912333022.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/0912333022.01.LZZZZZZZ.jpg | These hilarious stories by the creator of public television's\nlongest-running hit series capture the wacky sensibility and off-the-\nwall humor of the British sitcom. | en | ['Fiction'] | dallas | texas | usa |
| 1031174 | 1031174 | 278851 | dallas, texas, usa | 33.0 | 1569661057 | 10 | Dallas Street Map Guide and Directory, 2000 Edition | Mapsco | 1999.0 | American Map Corporation | http://images.amazon.com/images/P/1569661057.01.THUMBZZZ.jpg | http://images.amazon.com/images/P/1569661057.01.MZZZZZZZ.jpg | http://images.amazon.com/images/P/1569661057.01.LZZZZZZZ.jpg | 9 | 9 | 9 | dallas | texas | usa |